Picture for Gokhan Tur

Gokhan Tur

Bilkent University, Ankara, Turkey

PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents

Add code
May 02, 2025
Viaarxiv icon

TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons

Add code
Apr 28, 2025
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Viaarxiv icon

YourBench: Easy Custom Evaluation Sets for Everyone

Add code
Apr 02, 2025
Viaarxiv icon

Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models

Add code
Mar 03, 2025
Viaarxiv icon

SMART: Self-Aware Agent for Tool Overuse Mitigation

Add code
Feb 17, 2025
Figure 1 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 2 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 3 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 4 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Viaarxiv icon

Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model

Add code
Feb 12, 2025
Viaarxiv icon

Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling

Add code
Jan 17, 2025
Viaarxiv icon

Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems

Add code
Nov 15, 2024
Viaarxiv icon

ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents

Add code
Nov 01, 2024
Figure 1 for ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents
Figure 2 for ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents
Figure 3 for ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents
Figure 4 for ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents
Viaarxiv icon